智能论文笔记

Zero-shot hashtag segmentation for multilingual sentiment analysis

Ruan Chaves Rodrigues , Marcelo Akira Inuzuka , Juliana Resplande Sant'Anna Gomes , Acquila Santos Rocha , Iacer Calixto , Hugo Alexandre Dantas do Nascimento

分类：自然语言处理

2021-12-06

HashTag分段，也称为HashTag分解，是用于社交媒体数据集的预处理流水线的共同步骤。它通常先于情绪分析和仇恨语音检测等任务。对于中期到低资源语言的情感分析，以前的研究表明，一种多语言方法，即机器翻译的多语言方法可以竞争或优于任务的先前方法。我们开发了零拍摄具有零点的分割框架，并演示了如何用于提高多语言情感分析管道的准确性。我们的零拍摄框架为HASHTAG分割数据集建立了新的最先进的，甚至超过了以前的方法，依赖于在域内数据的特征工程和语言模型。

translated by 谷歌翻译

A Physics-Informed Neural Network to Model Port Channels

Marlon S. Mathias , Marcel R. de Barros , Jefferson F. Coelho , Lucas P. de Freitas , Felipe M. Moreno , Caio F. D. Netto , Fabio G. Cozman , Anna H. R. Costa , Eduardo A. Tannuri , Edson S. Gomi

分类：机器学习

2022-12-20

We describe a Physics-Informed Neural Network (PINN) that simulates the flow induced by the astronomical tide in a synthetic port channel, with dimensions based on the Santos - S\~ao Vicente - Bertioga Estuarine System. PINN models aim to combine the knowledge of physical systems and data-driven machine learning models. This is done by training a neural network to minimize the residuals of the governing equations in sample points. In this work, our flow is governed by the Navier-Stokes equations with some approximations. There are two main novelties in this paper. First, we design our model to assume that the flow is periodic in time, which is not feasible in conventional simulation methods. Second, we evaluate the benefit of resampling the function evaluation points during training, which has a near zero computational cost and has been verified to improve the final model, especially for small batch sizes. Finally, we discuss some limitations of the approximations used in the Navier-Stokes equations regarding the modeling of turbulence and how it interacts with PINNs.

translated by 谷歌翻译

Achieving Transparency in Distributed Machine Learning with Explainable Data Collaboration

Anna Bogdanova , Akira Imakura , Tetsuya Sakurai , Tomoya Fujii , Teppei Sakamoto , Hiroyuki Abe

分类：机器学习 | 人工智能

2022-12-06

Transparency of Machine Learning models used for decision support in various industries becomes essential for ensuring their ethical use. To that end, feature attribution methods such as SHAP (SHapley Additive exPlanations) are widely used to explain the predictions of black-box machine learning models to customers and developers. However, a parallel trend has been to train machine learning models in collaboration with other data holders without accessing their data. Such models, trained over horizontally or vertically partitioned data, present a challenge for explainable AI because the explaining party may have a biased view of background data or a partial view of the feature space. As a result, explanations obtained from different participants of distributed machine learning might not be consistent with one another, undermining trust in the product. This paper presents an Explainable Data Collaboration Framework based on a model-agnostic additive feature attribution algorithm (KernelSHAP) and Data Collaboration method of privacy-preserving distributed machine learning. In particular, we present three algorithms for different scenarios of explainability in Data Collaboration and verify their consistency with experiments on open-access datasets. Our results demonstrated a significant (by at least a factor of 1.75) decrease in feature attribution discrepancies among the users of distributed machine learning.

translated by 谷歌翻译

GET-DIPP: Graph-Embedded Transformer for Differentiable Integrated Prediction and Planning

Jiawei Sun , Chengran Yuan , Shuo Sun , Zhiyang Liu , Terence Goh , Anthony Wong , Keng Peng Tee , Marcelo H. Ang Jr

分类：机器人

2022-11-11

Accurately predicting interactive road agents' future trajectories and planning a socially compliant and human-like trajectory accordingly are important for autonomous vehicles. In this paper, we propose a planning-centric prediction neural network, which takes surrounding agents' historical states and map context information as input, and outputs the joint multi-modal prediction trajectories for surrounding agents, as well as a sequence of control commands for the ego vehicle by imitation learning. An agent-agent interaction module along the time axis is proposed in our network architecture to better comprehend the relationship among all the other intelligent agents on the road. To incorporate the map's topological information, a Dynamic Graph Convolutional Neural Network (DGCNN) is employed to process the road network topology. Besides, the whole architecture can serve as a backbone for the Differentiable Integrated motion Prediction with Planning (DIPP) method by providing accurate prediction results and initial planning commands. Experiments are conducted on real-world datasets to demonstrate the improvements made by our proposed method in both planning and prediction accuracy compared to the previous state-of-the-art methods.

translated by 谷歌翻译

Face Super-Resolution Using Stochastic Differential Equations

Marcelo dos Santos , Rayson Laroca , Rafael O. Ribeiro , João Neves , Hugo Proença , David Menotti

分类：计算机视觉

2022-09-24

传播模型已被证明对各种应用程序有效，例如图像，音频和图形生成。其他重要的应用是图像超分辨率和逆问题的解决方案。最近，一些作品使用了随机微分方程（SDE）将扩散模型推广到连续时间。在这项工作中，我们介绍SDE来生成超分辨率的面部图像。据我们所知，这是SDE首次用于此类应用程序。所提出的方法比基于扩散模型的现有超级分辨率方法提供了改进的峰值信噪比（PSNR），结构相似性指数（SSIM）和一致性。特别是，我们还评估了该方法在面部识别任务中的潜在应用。通用面部特征提取器用于比较超分辨率图像与地面真相，并获得了与其他方法相比，获得了卓越的结果。我们的代码可在https://github.com/marcelowds/sr-sde上公开获取

translated by 谷歌翻译

Visual Localization and Mapping in Dynamic and Changing Environments

João Carlos Virgolino Soares , Vivian Suzano Medeiros , Gabriel Fischer Abati , Marcelo Becker , Glauco Caurin , Marcelo Gattass , Marco Antonio Meggiolaro

分类：机器人

2022-09-21

完全自主移动机器人的现实部署取决于能够处理动态环境的强大的大满贯（同时本地化和映射）系统，其中对象在机器人的前面移动以及不断变化的环境，在此之后移动或更换对象。机器人已经绘制了现场。本文介绍了更换式SLAM，这是一种在动态和不断变化的环境中强大的视觉猛烈抨击的方法。这是通过使用与长期数据关联算法结合的贝叶斯过滤器来实现的。此外，它采用了一种有效的算法，用于基于对象检测的动态关键点过滤，该对象检测正确识别了不动态的边界框中的特征，从而阻止了可能导致轨道丢失的功能的耗竭。此外，开发了一个新的数据集，其中包含RGB-D数据，专门针对评估对象级别的变化环境，称为PUC-USP数据集。使用移动机器人，RGB-D摄像头和运动捕获系统创建了六个序列。这些序列旨在捕获可能导致跟踪故障或地图损坏的不同情况。据我们所知，更换 - 峰是第一个对动态和不断变化的环境既有坚固耐用的视觉大满贯系统，又不假设给定的相机姿势或已知地图，也能够实时运行。使用基准数据集对所提出的方法进行了评估，并将其与其他最先进的方法进行了比较，证明是高度准确的。

translated by 谷歌翻译

Evaluating Temporal Patterns in Applied Infant Affect Recognition

Allen Chang , Lauren Klein , Marcelo R. Rosales , Weiyang Deng , Beth A. Smith , Maja J. Matarić

分类：人工智能

2022-09-07

代理商必须连续监视其伴侣的情感状态，以了解和参与社交互动。但是，评估情感识别的方法不能说明在情感状态之间的阻塞或过渡期间可能发生的分类绩效的变化。本文解决了在婴儿机器人相互作用的背景下影响分类表现的时间模式，在这种情况下，婴儿的情感状态有助于他们参与治疗性腿部运动活动的能力。为了支持视频记录中面部遮挡的鲁棒性，我们训练了婴儿使用面部和身体功能的识别分类器。接下来，我们对表现最佳模型进行了深入的分析，以评估随着模型遇到丢失的数据和不断变化的婴儿影响，性能如何随时间变化。在高度信心提取功能的时间窗口期间，经过训练的面部功能的单峰模型与在面部和身体特征训练的多模式模型相同的最佳性能。但是，在整个数据集上评估时，多模型模型的表现优于单峰模型。此外，在预测情感状态过渡并在对同一情感状态进行多个预测后改善时，模型性能是最弱的。这些发现强调了将身体特征纳入婴儿的连续影响识别的好处。我们的工作强调了随着时间的流逝和在存在丢失的数据的存在时，评估模型性能变异性的重要性。

translated by 谷歌翻译

Non-readily identifiable data collaboration analysis for multiple datasets including personal information

Akira Imakura , Tetsuya Sakurai , Yukihiko Okada , Tomoya Fujii , Teppei Sakamoto , Hiroyuki Abe

分类：机器学习

2022-08-31

多源数据融合，共同分析了多个数据源以获得改进的信息，引起了广泛的研究关注。对于多个医疗机构的数据集，数据机密性和跨机构沟通至关重要。在这种情况下，数据协作（DC）分析通过共享维数减少的中间表示，而无需迭代跨机构通信可能是合适的。在分析包括个人信息在内的数据时，共享数据的可识别性至关重要。在这项研究中，研究了DC分析的可识别性。结果表明，共享的中间表示很容易识别为原始数据以进行监督学习。然后，这项研究提出了一个非可读性可识别的直流分析，仅共享多个医疗数据集（包括个人信息）的非可读数据。所提出的方法基于随机样本排列，可解释的直流分析的概念以及无法重建的功能的使用来解决可识别性问题。在医学数据集的数值实验中，提出的方法表现出非可读性可识别性，同时保持了常规DC分析的高识别性能。对于医院的数据集，提出的方法在仅使用本地数据集的本地分析的识别性能方面表现出了9个百分点的改善。

translated by 谷歌翻译

HTML版本

Another Use of SMOTE for Interpretable Data Collaboration Analysis

Akira Imakura , Masateru Kihira , Yukihiko Okada , Tetsuya Sakurai

分类：机器学习

2022-08-26

最近，已经开发了数据协作（DC）分析，以跨多个机构跨多个机构提供隐私的综合分析。 DC分析集中了单独构建的维度减少中间表示形式，并通过协作表示实现集成分析，而无需共享原始数据。为了构建协作表示形式，每个机构都会生成并共享一个可共享的锚数据集并集中其中间表示。尽管随机锚数据集对DC分析的功能很好，但使用其分布与RAW数据集的分布接近的锚数据集有望改善识别性能，尤其是对于可解释的DC分析。基于合成少数群体过度采样技术（SMOTE）的扩展，本研究提出了一种锚数据构建技术，以提高识别性能，而不会增加数据泄漏的风险。数值结果证明了所提出的基于SMOTE方法的效率比人工和现实世界数据集的现有锚数据构建体的效率。具体而言，所提出的方法在收入数据集的现有方法上分别实现了9个百分点和38个百分点的性能改进。提出的方法提供了SMOTE的另一种用途，而不是用于不平衡的数据分类，而是用于隐私保护集成分析的关键技术。

translated by 谷歌翻译

HTML版本

A First Look at Dataset Bias in License Plate Recognition

Rayson Laroca , Marcelo Santos , Valter Estevam , Eduardo Luz , David Menotti

分类：计算机视觉

2022-08-23

公共数据集在推进车牌识别（LPR）的最新技术方面发挥了关键作用。尽管数据集偏见在计算机视觉社区中被认为是一个严重的问题，但在LPR文献中很大程度上忽略了它。 LPR模型通常在每个数据集上进行训练和评估。在这种情况下，他们经常在接受培训的数据集中证明了强大的证明，但在看不见的数据集中表现出有限的性能。因此，这项工作研究了LPR上下文中的数据集偏差问题。我们在八个数据集上进行了实验，在巴西收集了四个，在中国大陆进行了实验，并观察到每个数据集都有一个独特的，可识别的“签名”，因为轻量级分类模型预测了车牌（LP）图像的源数据集，其图像的源95％的精度。在我们的讨论中，我们提请人们注意以下事实：大多数LPR模型可能正在利用此类签名，以以失去概括能力为代价，以改善每个数据集中的结果。这些结果强调了评估跨数据库设置中LPR模型的重要性，因为它们提供了比数据库内部的更好的概括（因此实际性能）。

translated by 谷歌翻译